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Abstract. We introduce a colored coalescent process which recovers random colored 
genealogical trees. Here a colored genealogical tree has its vertices colored black or 
white. Moving backward along the colored genealogical tree, the color of vertices may 
change only when two vertices coalesce. Explicit computations of the expectation 
and the cumulative distribution function of the coalescent time are carried out. For 
example, when x = 1/2, for a sample of n colored individuals, the expected time for 
the colored coalescent process to reach a black MRAC or a white MRAC, respectively, 
is 3 — 2/n. On the other hand, the expected time for the colored coalescent process 
to reach a MRAC, either black or white, is 2 — 2/n, which is the same as that for the 
standard Kingman coalescent process. 

1. Introduction. In the last twenty years, coalescent theory has been developed 
into a powerful analytical tool for population genetics. This theory is especially 
significant with the rapid accumulation of DNA sequence data. First formulated 
in the seminal work of Kingman in 1982 0], coalescent theory offers various 
sample-based and highly efficient statistical methods for analyzing molecular data 
such as DNA sequence samples. For recent reviews as well as extensive references 
of coalescent theory, see pQ and UJ. A nice introduction to coalescent theory can 
be found in [S]- 

Mathematically, coalescent theory studies stochastic processes leading to the 
most recent common ancestor (MRCA) from a sample under various coalescent 
models. If one thinks of the more commonly studied branching processes as sto- 
chastic models of generating random trees from their roots, coalescent processes can 
be thought of as the inverse processes which recover random trees from their leaves. 
In a more elaborated version crucial for population genetics, a coalescent process 
is usually superimposed with a mutation process. This mutation process can be 
thought of as an independent Poisson process running on the random tree gener- 
ated by the coalescent process, with the edge lengths of the random tree serving as 
the time scale for the mutation process. 

In this article, we introduce a coalescent process which generated random colored 
trees. Here a coloring of a tree is to color the vertices of the tree by two colors, 
black (B) and white (W), such that if two vertices are joint by an edge, they may 

Key words and phrases, genealogical tree, coalescent theory, colored coalescent theory, cumu- 
lative distribution function, random tree. 
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have different colors only when the vertex closer to the root is a branching point of 
the tree. Thus, in recovering a random colored tree using this coalescent process, 
we may end up at a colored tree with the root colored black or colored white. The 
quantities which we are interested in include the following: the probabilities for the 
coalescent process to reach a black or white root, respectively; the mean and the 
cumulative distribution function of the coalescent time, which is the time elapsed 
before the coalescent process reaches a black root or a white root, respectively; etc. 

Our goal in the mathematical development of colored coalescent theory is to 
understand the geographical models of the origin of the human being. Let us take 
two sets of gene sequences from the different geographical locations. Designate 
each by different color. We then mix them as a population sample. Based upon 
the sample, we look at which location the most recent common ancestor was arisen 
from. Our theory can give a answer theoretically. We hope to incorporate genetic 
data analysis of gene genealogies into the colored coalescent framework in our future 
study. 

The Wright-Fisher model in population genetics assumes discrete, non-overlapping 
generations Go, G±, G2, • ■ ■ in which each generation contains a fixed number N of 
individuals. In a so-called haploid population, each member in G^+i is the child of 
exactly one member in Gi, but the number of children born to the j's member of 
Gi is a random variable Vj satisfying the symmetric multinomial distribution 

N\ 

Vr{uj = nj ;j = l,2,...,N}= —— — ^. 

ni!n 2 ! • un'-N" 



We additionally assume that each individual in a generation has two possible 
colors B and W . In the next generation, if a member is the only child of its parent, 
then this child will inherit the color of its parent. But when a parent has more 
than one child in the next generation, the color of children of that common parent 
satisfies a binomial distribution. More specifically, for a parent with k children in 
the next generation, k > 1, let b be the number of children with B color and w be 
the number of children with W color (so that b + w = k), we have 



Pi j/i = /,• 1 . tr = },■■> : ! Ik- parent has color D\ = ( ^ )p fel (l — /;)* 

k 

h 



Pr{6 = ki, w = k 2 ; the parent has color W} = ( ) (1 - q) kl q k2 



(1) 



where < p, q < 1. 

Following the same argument as in 00], we have a limiting coalescent process 
for a sample of n colored individuals when N — > 00. In this limiting coalescent 
process, one only allow to have two individuals in the sample to coalesce. When 
two colored individuals coalesce, the probability of the color of their common parent 
can be calculated according to Equation QJ. Assuming that we may express those 
probabilities of various cases in the following multiplication rules: 

BB = xB + (1 -x)W 

BW = (1 — x)B + xW (2) 
WW = xB+(l- x)W, 

then we must have p = 1 — q and x = 1/2. For other values of x, the coalescent 
process itself is still defined, although we no longer have it as the backward process 
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of a branching genealogical process. Throughout this article, we will assume that 
< x < 1. We call this limiting coalescent process the colored coalescent process. 
See Figure 1 for an example of a colored genealogical relation. 



The stochastic character of a state of the colored coalescent process with the 
parameter x turns out to depend only on the number of individuals in this state 
colored by B. If we start the colored coalescent process with a sample of n colored 
individuals, the initial state can be denoted by a pair of non- negative integers (k, ri- 
te), where k is the number of individuals colored by B and n — k is the number of 
individuals colored by W . The absorbing states of the colored coalescent process 
are (0, 1) (a white root) and (1,0) (a black root), respectively. Our main result is 
Theorem 2, which states the coalescent probabilities P( ni ,n 2 )(o,i) anc ^ ^ > (ni,n 2 )(i,o) 
from different configurations of the sample to different functions of the colored 
coalescent times, T( ni „ 2 )( ,i) and T , (ni,n 2 )(i,o)- F° r example, when x = 1/2, the 
probabilities to reach (0,1) and (1,0), respectively, are both 1/2. The mean time 
to reach (0, 1) is 3 — 2/n, and the mean time to reach (1,0) is 3 — 2/ra. Finally, 
according to Lemma 2, the mean time to reach either (0,1) or (1,0) is 2 — 2/n, 
which is the same as in the classical coalescent theory. This shows the difference 
and similarity between our colored coalescent theory and the classical coalescent 
theory. 

In Section 2, we discuss in details the colored coalescent process. The technique 
of lumping turns out to be very important to simplify computations involved. So we 
also include in Section 3 a general discussion about lumping of Markov processes. 
In Sections 4 and 5, we use the lumping technique to calculate various statistic 
parameters of the colored coalescent process. In the paper jS], a comprehensive 
coalescent model which could accommodate the situation of mutations is studied. 



2. A colored coalescent model and some of its basic parameters. We start 
with the standard Kingman coalescent process [21 EJ- So we have a sample of 
n individuals. In a unit time At, two of these n individuals may coalesce with 
probability OjJAt. Let At — > 0, we arrive at a continuous stochastic process where 
the probability that no coalescent event happens within the time interval [0, t] is 



Here a coalescent event is the event that two of the n individuals coalesce into one. 
Thus the expectation of the time r n before the first coalescent event happens is 




Figure 1. A colored genealogical relation. 
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After the first coalescent event, the process will continue with a new generation 
of n — 1 individuals. The expected time elapsed between the first coalescent event 
and the second one is then B(r n _i) = ("g 1 ) ■ The process continues with the 
number of individuals in each generation decreases consecutively from n to 1, when 
a most recent common ancestor (MRCA) is reached. The expected time elapsed 
between the A:-th coalescent event and the (k — l)-th coalescent event is E(jk) = (2) , 
for k = n, n — 1, . . . , 2. The random variable 

n 
fc=2 

is the time elapsed before the coalescent process reaches a MRCA. Its expectation 
is 

k=2 v 7 

We now add an additional feature to this coalescent process by assuming that 
an individual in each generation can have two colors, black (B) and white (W). At 
the initial stage of the process, each of the n individuals in the current generation 
are given a color, B or W. The process then runs as before. When two individuals 
coalesce, the color of their common ancestor is determined by Equation (J2J. In 
other words, when an individual colored by B and another individual colored by B 
coalesce, the probability that their common ancestor is colored by B is x and colored 
by W is x :— 1 — x. Other situations are interpreted similarly. Furthermore, non- 
coalescent individuals will keep their colors unchanged after an coalescent event. 

A typical way to study the coalescent process is to consider a death process on 
the lattice points n, n — 1, . . . , 2, 1 on a line, with the time spending on the point k 

exponentially distributed with the mean ( 2 ) . We will do the same for our colored 
coalescent process. 

Fix an integer n > 0. Consider a proper subset £ of the plane integer lattice: 

£ = {(k, I) e Z x Z; k > 0, 1 > 0,0 < k + 1 < n}. 

A point (k, I) £ £ represents a colored generation of k + l individuals, with the num- 
ber of £?-colored individuals equal to k, and the number of IV-colored individuals 
equal to I. 

Let's denote by Z(t) the Markov process with the state space £ and the infini- 
tesimal generator Q = (q^n) given by the following formulas: 



'*(a)> 


if n = (k - 2, 1 + 1), given C = (A, l)\ 


ac(*) +xkl, 


if n=(k- 1,1), given C = (k, I); 


x( 2 ) + xkl, 


if 7] = (k,l- 1), given C = (k, I); 




if r]= (k + l, I- 2), given ( = (k, I); 




if X] = (k,l), given ( = (k,l); 


k 


otherwise. 



(3) 



Here and in what follows, we take (^j = if n < m. 

Notice that Q is a square matrix with the size n(n + 3)/2. We write it in the form 
of a block matrix with diagonal (— r n I n +x, — r n _iJ n , — ^Ia), O2) an d updiagonal 
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(B n +i,n,Bn, n -i,—,Ba t 2), where r k = ( 2 ), h is the k x k identity matrix, 2 is 
the 2x2 zero square matrix, and Sfe,fe-i is a k x (fc — 1) matrix representing the 
transition (coalescent event) from the diagonal A k -i = {(li,h) G £ ; h +h = k— 1} 
to the diagonal Afc_ 2 = {{I1J2) G £ ; Zi + Z 2 = k — 2} of £. All other entries of Q 
are zero. 

Associated with the colored coalescent process Z(t), there is a discrete time 
Markov chain called the jump chain of Z(t). Its has the same state space C as that 
of Z(t) and its probability transition matrix is denoted by J. 

The jump chain of Z(t) has two absorbing states (0,1) and (1,0). When the 
coalescent process Z(t) undergoes k coalescent events, the associated Markov chain 
moves k steps. Thus, if we starts at a state (rii,n 2 ) G C with m + n 2 = n, the 
probability of arriving at a state (h,h) G £ with h + h = n — k can be read out 
from a row of J k . 

We denote 

C n ,n-k+l ■= r~ 1 r n _ 1 • • • f n _ k+1 B n +l,n ' • • S„ ; „_iS„_fc + 2, n -fe+l, 

for k < n — 1. This is a (n + 1) x (n — k + 1) matrix. To fix this matrix, we assume 
that the states are listed as (0, n), (1, n — 1), (2, n — 2), ... , (n, 0), (0, n — 1), (1, n — 
2),..., (0,1), (1,0). 

Lemma 1. If Z{t) starts at (721,712), w/iere ni + n 2 = n, the probability that it 
reaches {h,h), where l\ +I2 = n — k, after k coalescent events is the (m + + 1)- 
entry ofC n , n - k +\- 

Notice that in particular, if Z(t) starts at (ni,n 2 ), where n\ + n 2 = n, the 
probability that it reaches (0, 1) is the {n\ + 1, l)-entry of C„ j2 , and the probability 
that it reaches (1, 0) is (ni + 1, 2)-cntry of C„. 2 . This means that after n—1 coalescent 
events, the process Z(t) will be absorbed by either (0, 1) or (1, 0) and the absorption 
probabilities are given by the matrix C„ j2 . 

Proof. By a direct computation, we see that the block of J k corresponding to the 
transition from A„ to A„_fe is exactly C nifl _k+i. Then starting at a state (ni,n 2 ) 
with ni+n 2 = n, the (ni + l)-th row of C n; n-k+\ give us the probability distribution 
of the process Z(t) reaching the state (h,h) with h + l 2 = n — k after k coalescent 
events. □ 

We calculate next the expectation of the time TV for the coalescent process Z{t) 
to reach an absorbing state, given that it starts at an initial distribution 7r on A„. 
For a non-absorption state (k,l) G C, let a^/^n be the the sojourn coefficient of 
(k, I), which is the expected number of times the associated jump chain of Z(t) to 
visit the state (k,l), given that it starts at a distribution 7r on A n . Furthermore, 
let uj(k,i) be a random variable distributed exponentially with the mean rj^j. The 
random variable U(k,i) is the time before the process Z(t) to reach a state in Ak+1-1, 
given that it starts at the state (fc, I). These random variables ^(k,i), {k,l) G C and 
k + I > 2, are independent. Finally, we define a random variable 

n 

Trr = X] X! a ^,(fe,0 w fe.' ( 4 ) 

m=2 fe+J=m 
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Lemma 2. The expectation of the coalescent time is given by 

" 1 2 
E(T 7r )=J2 — =2--- 

Proof. The Feller relation (see [7]) says E(T W ) = E(T V ). By a calculation involv- 
ing the fundamental matrix for the associated Markov chain, it turns out that 
Tlik+i=m a it,{k,l) = 1 f° r eacn Ti > 2. (This matrix computation is straightforward 
but quite tedious. On the other hand, we shall see very easily while it is true in 
the proof of Theorem (2J) Since E(u>k,i) — r k+i> we S e t the desired expectation of 

IV. ' □ 



3. The lumpability of Markov processes. The full details of the coalescent 
process Z(t) is hard to compute in general. For example, there are two absorption 
states, (0, 1) and (1, 0). The coalescent time TV is the time elapsed before the process 
Z(t) reaches any one of these two states. Can we compute the coalescent time to 
a particular one of these two states? In order to do this, the method of lumping 
seems to be quite suitable. In what follows, we will show that Z(t) can be lumped 
into another Markov process. For this lumped Markov process, the computation of 
most of its interesting parameters can be made explicit. For that purpose, we need 
to discuss the notion of lumpability of a Markov process first. 

Let X n be a Markov chain with finite state space S = {e±,e2, ,e r }, and 

S = {E\, E2, , E v } be a partition of S. Let pij be the transition probability 

from a to ej. The probability that the chain moves into the set E v in one step, 
given that it starts at a, is equal to X^e-E Pik- We sa y that the Markov chain X n 
is lumpable if for , ej G E^ , 

zl >'■>■ = zl i'- 1 - 

When the chain X n is lumpable, we define a Markov chain X n on S with the 
transition probability from E^ to E^ equal to p^ v = S efc e_B^P ife ' f° r e i ^ E^. This 
chain X n is called a lumping of X n . See 

Now, we consider a Markov process X(t) on S. Let P(t) — {pij(t)) be the 
probability transition matrix of X(t). We define that X{t) is lumpable if for e^, ej S 

E P*(*)= E ^ fe W' f oralU>0. 
Suppose that X(t) is lumpable. We define 

for e, € 2%. Let P(t) = (p^ v (t)). This is an u x u matrix. 
Lemma 3. P(t) defines a Markov process on S. 
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The proof is to check the property that P{t)P{t + s). It can be done directly by 
computation. We call the Markov process X(t) on S with the probability transition 
matrix P(t) a lumping of X(t). 

Let Q = (qik) be the infinitesimal generator of X(t), i.e. P(t) = e f ®. We say 
that Q is lumpable if 

E * fc = E ^ k 

for a, ej £ E%. 

We need to introduce some notations in order to prove the following theorem 
which relates the lumpability of Q with that of P(t) = e*^ . Recall that the finite 
set of states S = {e\, e%, . . . , e r } is partitioned into S = {Ei,E 2 , ■ ■ ■ , E v }. Let U be 
the v x r matrix whose £-th row is the probability vector having equal components 
for states in E^, £ = 1,2 ... ,v, and elsewhere. Let V be the r x v matrix with 
the 77-th column a vector with l's in the components corresponding to states in E^, 
i] = 1,2, ... ,v and elsewhere. One can check by a direct computation that UV = 
I v and that the Markov process X(t) is lumpable if and only if VUP(t)V = P(t)V . 
(This statement generalizes a theorem about lumpability of Markov chains in j2j.) 
Furthermore, when P(t) is lumpable, we have the P(t) = UP(t)V for the lumping 
process X(t). Similarly, the infinitesimal generator Q of X(t) is lumpable if and 
only if VU QV = QV. And when Q is lumpable, its lumping is Q = U QV. 

Theorem 1. A necessary and sufficient condition for the Markov process X(t) to 
be lumpable is that its infinitesimal generator Q is lumpable. When Q is lumpable. 
we have P(t) = e*^ . 

The intuitive idea of this theorem is clear, we therefore will not give the detailed 
proof here. 

4. Parity lumping of the colored coalescent process. The state space C of 
Z(t) is partitioned into diagonals A m , m — 1, 2, . . . , n. We will divide each A m into 
two disjoint subsets, O m and E m . A state (fc, I) G O m when k + l = m and k is odd 
and (fc, I) G E m when k + I = m and k is even. Let 

£ = {O m , E m ; m = 1, 2, . . . , n}. 

We will define a new Markov process on £ obtained by a lumping of the process 
Z(t). For that purpose, we need to check the lumpability of Z(t). Using Theorem 
^ we only need to check the lumpability of the infinitesimal generator Q = (q^n) 
given by Equation Q. 

Let C = (fc, I) £ O m , then k + I — m and k is odd. We have 
and 

E i «Cn = »(*) +*kl + xfy =«(a)- 

Both of them are independent of k. Other lumpability conditions for Q can be 
checked similarly. So, by Theorem ^ Z(t) has a lumping Z(t), which is a Markov 
process on C with the infinitesimal generator Q. We will order the elements in C by 
E n , O n , E n —i, O n -\, . . . , E2, O2, Ei, 0\. Under this ordering, Q is a block matrix 
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with diagonal {—r n l2, — T"2-?2, 02) and updiagonal (r n C, ...,rzC), where C is a 2x2 
matrix: 



C 



x x 

X X 

We will follow the same idea as in Section 2 to compute basic parameters of the 
lumped coalescent process Z(t). We shall be able to get more complete information 
about the process Z(t) than about Z(t). 

(1) The fundamental matrix of the jump chain of Z(t). The probability transition 
matrix of the jump chain of Z(t) is a block matrix with each updiagonal entry C and 
each diagonal except the last Ii- Notice that the states E\ and 0\ are absorbing 
states. So the fundamental matrix iV of the jump chain can be easily calculated. 
This is a block updiagonal matrix, each of whose diagonal entry is I2, each of whose 
updiagonal entry is C, each of whose second updiagonal entry is C 2 , up to the entry 
C"~ 2 . By a straightforward computation, we may get 
" i^i(l-2x) fe I-I(l-2x) fe " 



C k = 



\-\(l-2x) k ! + i(l-2x) fe 



(2) The sojourn coefficients of the jump chain. The sojourn coefficients (respec- 
tively, bk) is the expected number of times the jump chain J of Z(t) to visit the state 
Ek (respectively, Ok), given that it starts at an initial distribution n = (n E ,no) on 
the states {E ni O n }. 

Lemma 4. We have 

(<*fcA) ={l + l(^- tto) (1 - 2x) n -\ \ + \ (n n E ) (1 - 2.*)- fe ) (5) 

Proof. The sojourn coefficients (ak,bk) can be calculated as follows: 

(Ofc.&fc) = (TT E ,TT )C n - k 

= (n E ,no){ l_l {1 _ 2x)n - k l + l {l _ 2x) n-k 

\ + ttq) (1 - 2x) n -\ \ + \ (no n E ) (1 - 2x)- fc 

□ 

(3) Coalescent probability for the lumped coalescent process Z(t). Since E\ and 
0\ are absorbing states, the coalescent probability P^.e (respectively, Ptt,o) is the 
probability of the jump chain to arrive at E\ (respectively, Oi), given that it starts 
at a distribution n = (ir E , no) on the initial states {E n , O n }. 

Lemma 5. We have 

(Pe,Po) = Q + \ (n E - tto) (1 - 2a;)"- 1 , \ + \{*0 ~ *e) (1 - 2x) n ^ 

In particular, let Pe,e be the probability of reaching E\, given that the process starts 
at E n , and other quantities Pe,o, Po,e, Po,o be defined similarly, then we have 

Pe,e = \ + \{l - 2x)-\ P E . = \-\(l- 2x) n -\ 

11 11 (6) 

Po.E = 2 - - 2x) n ~\ Po,o = 2 + - 2a:)"- 1 
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Proof. Since a minimal Markov process on a finite set of states and its jump chain 
have the same character of states, E\ and 0\ are absorbing states for both Z{t) 
and J. So we have {P-n ,e , P-k ,o) — {ai,b\). □ 

(4) The expected coalescent time for the lumped coalescent process Z{t). Since E\ 
and 0\ both are absorbing states of the process Z(t), the coalescent time T Vi e to 
the state E\ is a random variable, which is the time elapsed before the process 
Z(t) reaches E\, given that it starts at the initial distribution 7r = (tte^o) an d 
conditional on Z{t) not reaching 0\. Similarly, we have the coalescent time T^o to 
the state 0\ as a random variable. To calculate the expectation of the coalescent 
time, we will define a random variable T v> e first and calculate its expectation. 
Then the Feller relation will tell us that the expectation of T n ,E is exactly the same 
as the expectation of the coalescent time T n>E to the states E\. I. e., we have 

E{T v ,e) — E(T^^e)- 

Let Tfc be a random variable distributed exponentially with the mean , which 
is the time elapsed before the lumped coalescent process Z(t) moves either to Ek-i 
or Ok-i, given that it starts at Ek (or Ok). Recall that the sojourn coefficients 
afc (respectively, bk) is the expected number of times the jump chain J of Z(t) to 
visit the state Ek (respectively, Ok), given that it starts at a initial distribution 
7T = (7T£,7To) on the initial states {E ni O n }. Then the random variable Tk,e is 
defined by 

n n , 

~ v— v v— v 0,2 t>2 

Tk,E = a k T k + °k T k + —T2 H T 2 

k—3 k—3 ^-y^ 

= E 7 * + i C 1 + fa - *E)(1 T2. 
k=3 

Similarly, we may define a random variable T^.o, whose expectation turns out 
to be the same as the expectation of the coalescent time T^.o to the state 0\: 

n 1 

T*,o = zZ^k + ^- (1 + (tte - 7T )(1 - 2a;)"- 1 ) r 2 . (8) 

k=3 



Lemma 6. The expectation of the coalescent time to E\ and 0\ are 
E(T^.e) = 1 - - + (1 + (ttq - 7T B )(1 - 2a;)"- 1 ) 



and 



E(T^ ) = 1 - 1 + -L (l + - 7ro )(l - 2*:)- 1 ) 



respectively. 

Proof. The expectations 



= 1 - - + ^- (1 + (tto - ttb)(1 - 2a;)"- 1 ) 

71 AXX 

E(f^ ) = 1 - - + -1- (1 + (tt b - 7T )(1 - 2a;)"- 1 ) 

71 AXX 
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are easy to compute since we know that E(rk) = r k and Tfc's are independent 
random variables. To see why 

E(f n , E ) = E{T^ E ), 

we define a conditional process whose state characters are all the same as that of 
the process Z(t) except for the states E 2 and 2 . At these two states E 2 and 2 , 
the conditional process will move only to Ei with the mean of holding time =^ and 

-^q, respectively. Then, by the Feller relation, E(T 7I _ E ) is the expectation of the 
coalescent time for this conditional process to reach E\. On the other hand, the 
coalescent time for the conditional process to reach E\ is exactly T^^e- So we have 
E(T VtE ) = E(T VtE ). Similarly, we have E(T„ tQ ) = E( T *,o)- □ 



(5) The cumulative distribution function of the coalescent time of Z(t). The com- 
putation of the complementary cumulative distribution functions Pr {T^.e > t} and 
Pr{T7r,o > t} is rather involved. So we will put it in the appendix and give here 
only the results. 

We need to introduce some functions of a; first. They are K n2 , K n 2 i, K n i 2l and 
Kn',2'- Here the indices 1, 2, . . . , n are for the states Ei, E 2l ...,£"„ and the indices 
1', 2', . . . ,n' are for 0\, 2 , ■ ■ ■ , O n . These functions are given as follows: 

/ \ n— 3 n — 3 



2 2 / j-^ r n - k -xr 2 AA r n -i - r n _ fe 

' fe=0 t=0 
\ n— 3 n— 3 

K n ,v = ( \ - \{l - 2xY-A £ ""^ I] ^ ■ 
v 2 2 J r„_ fe - xr 2 A± rn _ t _ r?i _ fc 

■ -. \ n— 3 n — 3 

= ( J - ki - 2X)"- 2 ) e r - fc _ n r "-* , 

/ k— «— 

• 1 \n — 3 n — 3 

i /i n \n — 2 \ \ A Tn—k 1 I Tn—i 



(-. w \ ft — O 7t — O 

+ (1-2HE r "~ fc n 



„ T'n — k XT 2 . T n —i 7*n—k 



(9) 



We denote with 7Te = l,7ro = by Te,e- Other coalescent time To,e, 

Te.o, an d Tq.o have the same meaning. 



Lemma 7. We have 



Pv{T E , E >t} = 



n — 3 n — 3 

r„-j - r„_ fe \ xr 2 - r n - k xr 2 - r n - k 



k=0 i=0 



ETT 7 n-i I 2 _j_ 2 V J ' _| 



+ K na e- xr * + K n , 2 ,e- xr2t 
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and 

Pr{T .E >t} = 

y tt r n ^ ( x(\ + - 2.x)- 2 ) | x(\ 1(1 - 2.)- 2 ) - ) t , „ 



Also, Vi{T E . >t} = Py{T .e > t} and Pr{T .o > t} = Pr{T E , E > t}. 



5. Back to the colored coalescent process Z(t). For the parity lumping of the 
colored coalescent process, we have a commutative diagram 



z(t) jump chain > J 



lumping 



lumping 



z(t) jump chain > 7. 

This can be shown by a simple matrix computation. Notice that, for an arbitrary 
lumpable Markov process, we do not know if this diagram is commutative or under 
what condition this diagram is commutative. 

The commutativity of the above diagram for the parity lumping of our colored 
coalescent process Z(t) provides a way for us to recover information about Z(t) from 
our knowledge of the lumped coalescent process Z(t). Due to the fact that both 
processes Z(t) and Z(t) have the same absorbing states, we can achieve a complete 
recovery of information for certain parameters of the coalescent process Z(t). 

Let N be the fundamental matrix of the jump chain J, of Z(t), and N be the 
fundamental matrix of the jump chain J of Z(t). Notice that the absorption states 
of J is E\ = {(0,1)} and 0\ = {(1,0)}. Let Uq and Vb be matrices obtained 
from U and V by dropping the last two rows and columns corresponding to the 
states (0, 1) and (1, 0). Then it is easy to verify that N is lumpable by Uq and Vb: 
V U NV = NV . Furthermore, we have N = U NV . 

The following is the main theorem of this section. 

Theorem 2. Let P( ni ,n 2 )(o,i) an d P( ni ,n 2 )(i,o) be the probabilities that the coalescent 
process Z(t) reaches (0,1) and (1,0), respectively, given that it starts at (711,712), 
m + ri2 = n. Then we have 

p _ \ 1 + l( l - 2x ) n ^ ^ if rn is even, 

)(o,i) " \| - |(1 - 2x)«-\ ifm z S odd; m 

P _ )l ~ hi 1 ~ 2 a0"~\ if ni is even, 

(ni '" 2)(1 ' 0) " \\ + |(i - 2x)-\ if ni is odd. ( ' 

Furthermore, let T( ni .„ 2 )(o.i) (orT^j^jp^)) be the time for the coalescent pro- 
cess Z(t) to reach (0,1) (or (1,0)), given that it starts at (711,712), n\ + n 2 = n. 
Then the expectations of T( ni ,n 2 )(o,i) an d T( ni ,n 2 )(i,o) are gi ven as follows: 

E(T/ w ^_ /l-^ + Si(l-(l-2«) n - 1 ), if mis even, 

( ( — ~ \i - 1 + & (1 + a - ^r- 1 ) . if* » odd; (12) 
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E{T( ni ,n 2 )(l,0)) 



'l-f + 2is(l + (l-2^)"- 1 ), ifnii 



is even, 



1 - I + 21b i 1 - (! - 2a; )" _1 ) . zs 0fM 



(13) 



Proof. The absorption probabilities of the jump chains J and J are calculated from 
the matrices 





" 




" 




" 




" 


N 




and N 




= U Q NV 




= U N 






















c 




C 




c 




c 



respectively. So P( ni na)(o,i)) f° r n i + ?1 2 = ti and n\ even, is equal to Pe,e- By 
Equation JJJ, we get the desired value for P(„ 1 .„ 2 )(o.i) in this case. All other cases 
can be obtained similarly. 

To compute the coalescent time, we first denote by a(m,n 2 )(fci,fc 2 ) the sojourn 
coefficient of the jump chain J, which is the expect number of times the jump chain 
J visits the state (k\,k2), given that it starts at the state (ni,na), n\ + n-i = n. 
Since the jump chain J is lumpable and its lumping is the jump chain J, we have 



^ a (ni,™ 2 )Oi,fc2) - a k 

ki~\-k,2=k, k\ even 

^ a (ni,n2)(fci,fc2) = b k' 

ki~\-k2—k, ki odd 



(14) 



where au and bk are the sojourn coefficients of J corresponding to the expected num- 
ber of times J visits Ek and Ok, respectively, given that it starts at the distribution 



(1, 0), if n\ is even, 
(0, 1), if m is odd. 



(15) 



Thus, we have 



(ni,n 2 )(0,l) 



a (rn,n 2 )(l,l) 



T2 



k=3 V fei+/c 2 =fc / 

. a (ni,n 2 )(0,2) a (m,n 2 )(2,0) 

H — T2 H — T 2 

X X X 

n n , 

Ev — > &2 ^2 
dkTk + h k T k + —T 2 H T 2 

fc=3 fe=3 
= T ni E, 

where it is the distribution given by Equation i|15|) . So Lemma 6 gives us the 
desired expectation of T'(n 1 ,„ 2 )(o,i)- The calculation of £7(T( n na v 1(0 )) is exactly the 
same. □ 



Remark 1. Equation (|14|l shows Ylk+l=m a -n.(k,i) = 1 as claimed in the proof of 
Lemma |2| 
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Remark 2. For (ni,ri2) € A n with n\ even, it is obvious that T( ni „ 2 )(o.i) = Te.e- 
Thus Lemma 2.7 also gives us the complementary cumulative distribution function 
of T(ni.n 2 )(o,i)- All other cases are the same so that the cumulative distribution 
functions of T(n\, n2)(0, 1) and Tr ni<n2 \M m are all known. 

There is, of course, information of the coalescent process Z(t) which can not be 
recovered from what we know about Z(t). For example, recall that the parity of a 
state (k, I) £ £ is the parity (even or odd) of the integer k. Let us define p k to be 
the parity of a state after the fc-th coalescent event. In other words, p k is a random 
variable which takes the value if the coalescent process Z(t) is in Ek and takes 
the value 1 if the coalescent process is in O k , after the fc-th coalescent event. We 
will use po to denote the random variable which takes the value if the parity of 
the initial state is even and the value 1 otherwise. Then we have 



Pr(p k 


= 


!po 


= 0) = 


1 

2 




- 2x) 


Pr(p k 


= 1 


Po 


= 0) = 


1 
2 




- 2x) 


Pr(p k 


= 


Po 


= 1) = 


1 

2 




- 2x) 


Pr(p k 


= 1 


Po 


= 1) = 


1 

2 


+ \* 


- 2x) 



(16) 



From Equation Ijltjfl . we can get the probability of sequences of n — 1 coalescent 
events with the parities of the passing states all specified. For example, the prob- 
ability that all of n — 1 coalescent events happen in states with even parity, given 
that the parity of the initial state is also even, is 

Pr(pfc = 0;fc = 0,l,2,---,n-l) = n - + -(l-2x) fe j . 

k=0 > ' 

The probability of other sequences of coalescent events with specified parities can 
be calculated similarly. 

On the other hand, starting at {ni,ri2), where Ui + ni = n, after k coalescent 
events, we do not know exactly the distribution of the states (Zi, I2) , h + h = n — k. 
The knowledge of the distribution of the parities of these states is not enough for 
recovering the distribution of these states. 
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